Multiple Imputation: Better than Single Imputation in Pain Studies?

نویسنده

  • Ashik Chowdhury
چکیده

Missing data is one of the genuine challenges we face during analysis of clinical trials. The main impact of missing data is that it can infuse bias in results which reduces the chance of getting the appropriate interpretation. Hence proper knowledge of techniques for handling missing data is crucial. A common method of handling this problem is by imputing missing values. There can be single or multiple imputations. While single imputation methods like LOCF or BOCF have been widely used, multiple imputation is of a comparatively recent origin but is gaining popularity. The main reason behind its popularity is it overcomes some of the drawbacks of single imputation. The current paper presents comparative effectiveness of each with the help of real life cases involving pain studies. INTRODUCTION The most serious concern of missing data is that it can introduce bias into the results of a clinical study, particularly if the data are missing not at random. Missing data causes the loss of information and in turn loss of statistical power. So the major challenge for these pain studies becomes interpreting the results. It is often preferred to put maximum effort at the design stage to achieve complete capture of all data to minimize the bias and maximize the power (e.g. continue collecting pain outcomes even after subject discontinuation). However, even with the best designed study missing data is still a potential issue. As such, various imputation methods are often employed when handling data of this nature. There are a few standard procedures to impute missing data. Single imputation usually identifies a particular record for a subject, e.g. baseline or just the previous non-missing value and repeats it for the missing data points. Multiple imputation uses a predicted value for a given subject and time point using statistical modelling of available data. Osteoarthritis (OA) is a painful and debilitating musculoskeletal disease that is characterized by intra-articular (IA) inflammation. In these studies pain is typically measured on a daily basis following study drug injection for a prespecified length of time. Pain intensity is generally measured by self-assessment with validated methods such as the visual analogue scale (VAS) or numeric al rating scale (NRS, 0-10). Depending on the length of follow-up of these studies and the fact that pain assessments are collected daily, one potential problem is the frequent occurrence of missing data. This paper will illustrate several different imputation methods that are possible for handling missing data in pain studies. It will also explore the challenges faced in both of the imputation techniques, single and multiple, using a comparative study. REGULATORY GUIDANCE There is no universally applicable method of handling missing data recommended by any regulatory body. Existing regulatory guidance does not have specific instruction on how to address the problem of missing data. Realizing this gap the U.S. Food and Drug Administration (FDA) created a panel on Handling of Missing Data in Clinical Trials. This panel prepared a report which summarized how to reduce the amount of missing data and appropriate statistical methods to address missing data for analysis. Table 1 displays some selected recommendations from that report (The Prevention and Treatment of Missing Data in Clinical Trials, Panel on Handling Missing Data in Clinical Trials; National Research Council)

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Accuracy evaluation of different statistical and geostatistical censored data imputation approaches (Case study: Sari Gunay gold deposit)

Most of the geochemical datasets include missing data with different portions and this may cause a significant problem in geostatistical modeling or multivariate analysis of the data. Therefore, it is common to impute the missing data in most of geochemical studies. In this study, three approaches called half detection (HD), multiple imputation (MI), and the cosimulation based on Markov model 2...

متن کامل

An Empirical Comparison of Performance of the Unified Approach to Linearization of Variance Estimation after Imputation with Some Other Methods

Imputation is one of the most common methods to reduce item non_response effects. Imputation results in a complete data set, and then it is possible to use naϊve estimators. After using most of common imputation methods, mean and total (imputation estimators) are still unbiased. However their variances (imputation variances) are underestimated by naϊve variance estimators. Sampling mechanism an...

متن کامل

چند رویکرد برخورد با مقادیر گمشده‌ متغیرهای کمی و بررسی اثر آنها بر نتایج حاصل از یک کارآزمایی‌ بالینی

Background and Objectives: A major challenge that affects the longitudinal studies is the problem of missing data. Missing in the data may result in the loss of part of the information which reduces the accuracy of the estimator and obtain the results will be biased and inaccurate. Therefore, it is necessary to evaluate the missing data mechanism from a longitudinal research and to consider thi...

متن کامل

کاربرد جای گذاری چندگانه در تحقیقات پزشکی و اپیدمیولوژی

Data missing, which occurs for different reasons, is an unavoidable problem in epidemiological studies. It is quite widespread and, therefore, it is considered as a challenge in research design and data analysis by many methodologists. Complete case analysis is often used in studies with missing data however, this approach may result in inaccurate estimates and inferences due to bias associated...

متن کامل

Effect of Reference Population Size and Imputation Methods on the Accuracy of Imputation in Pure and Mixed Populations

    Imputation as a method of creating low-density chips to high-density chips has been introduced to increase the accuracy of genomic selection in animals. In the current study, to investing imputation accuracy, three populations of mixed (scenario 1), pure (scenario 2) and mixed + pure (scenario 3) were simulated using QMSim. Two methods of imputation including Beagle and Flmpute were used fo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014